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Abstract 

In analysing a well-known hash-coding method, Knuth gave an 
exact expression for the average number of rejections encountered by 
players of a variant of musical chairs. We study a variant more closely 
related to musical chairs itself and deduce the same expression by a 
purely combinatorial approach. 

In an analysis of the average time to insert an item when using open- 
address hash-coding, Knuth [H p. 528-530] reduced the problem to the fol- 
lowing question about musical chairs. Given m chairs arranged in a circle and 
numbered clockwise from to m — 1, if in turn each of n people arrives at a 
randomly selected chair (his initial chair) and walks clockwise until he finds 
an empty chair (his final chair), what is the average number of rejections 
(chairs found occupied during the search) per player? 

For m > n, Knuth showed that the average number of rejections is 




where n l = the number of ways of choosing from n items an ordered 

subset of i items. Knuth's analysis is very complex and gives no clue as to 
why the answer is so simple. Knuth appreciated this issue and asked for an 
explanation [2]. 

In the following analysis we reduce the problem yet further, to a game 
even closer to musical chairs, in which the average number of rejections is 
the same as in Knuth's game. We establish a one-to-one correspondence 
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between the set of all rejections encountered, over the whole sample space, 
and a set whose cardinality is easily established. In this way we eliminate all 
of the involved algebra of Knuth's proof. Regrettably, despite the absence of 
algebra, the following combinatorial proof is not as simple as the author first 
envisaged when he undertook the search for it. A simpler proof would be a 
valuable contribution to this facet of the theory of hash coding. 

In musical chairs, the players move around until the music stops, at which 
time they all rush for chairs. To change Knuth's problem to resemble more 
closely this situation we shall require all n people to have arrived before 
letting them search for their chairs. In effect, the people are partitioned into 
m labelled blocks (the chairs providing the labels), some of which may be 
empty. All blocks then travel around the chairs clockwise at the same speed, 
each losing one member to each vacant chair it passes until it becomes empty. 
This process is the seating process. To resolve seating conflicts, we rank the 
players in advance. The reader should have little difficulty in verifying that 
this version of musical chairs involves the same average number of rejections 
per player as does Knuth's version. 

For our analysis, we shall count the total number R of rejections suffered 
by all players, summed over all possible samples (assignments of players to 
initial chairs). The average is then (we must assume the players are 
distinguishable), which implies that R should be 

\{nhn n - x + nhn n - 2 + rAn n ~ z + ...) 

We say that a sample S of n players matches a sub-sample T of k out of 
those n players, for k < n, when every member of the j-th block of T is a 
member of the j-th block of S, for j = 0, 1, ...,m — I. When a sub-sample 
P of k out of our n players has a block of two people assigned to chairs c, 
k — 2 blocks, of one person each, assigned to chairs c + 1, c + 2, c + k — 2 
(addition mod m), and no other blocks, we call P a k-pattern. Exactly m n ~ k 

k 

samples match any given /c-pattern, and there are possible /c-patterns, 
taking rotations into account. Hence the total number of possible matches 
of samples with A;-patterns, for k > 2, is 

-(nhn 11 - 1 + nhn 11 - 2 + vhn n ~ z + ...), 
2 

which equals the total number of rejections given by Knuth's analysis. We 
shall give a one-to-one correspondence between rejections and matches, which 
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immediately gives us an alternative derivation of this expression for the num- 
ber of rejections. 

Given a sample S and a particular rejection, say of player a at the seat 
containing player z, we shall construct a particular match between some 
sample T and some pattern P. We shall form T from S merely by relabelling 
the blocks of S, that is, by permuting them with respect to the chairs. The 
permutation amounts to gathering together a certain subset of the blocks, 
called the distinguished blocks. The block containing player a (call it bi) 
remains where it is, at chair c, and is the first distinguished block. The 
other distinguished blocks of S (call them 6 2 , &3, &fc for some k > 1) are 
slid around the circle counter-clockwise by interchanging them with the non- 
distinguished blocks, until blocks b±, b 2 , b^ occupy chairs c, c+1, c+k — 1 
respectively. The undistinguished blocks occupy the remaining chairs. Note 
that, among themselves, the k distinguished blocks retain their relative initial 
seating order, as do the m — k undistinguished blocks. 

We now specify the distinguished blocks. Block bi has already been spec- 
ified. Block b i+ i is specified by the following procedure. 

1. If block bi contains player z, then we are done and k = i. We refer to 
player z as 

2. Otherwise, let di be the last chair to which bi lost a member (call him 
Pi) before bi (possibly exhausted) encountered z's final chair. Then b i+ i 
is the block whose initial chair is di + 1. 

This procedure completes the description of the construction of T. Note 
that there is a question as to whether di is always defined, which is dealt 
with now. 

In the following we let [a, b] denote the set of chairs {a, a + 1, a + 2, b}, 
the addition being modulo m. We let (a, b] = [a, b] — {a}, [a, b) = [a, b] — {b}, 
and (a, b) = [a, b] — {a, b}. We let b_i denote the initial chair of bi, and z the 
final chair of z. A player sits in a set S of chairs when his final chair is in 
S. A block of players sits in S when some player in that block sits in S. A 
set C of chairs sits in S when some block whose initial chair is in C sits in 
S. The dual of "sits" is sits only; a block sits only in S when all players in 
the block sit in S, and a set C of chairs sits only in S when all blocks whose 
initial chairs are in C sit only in S. 

The ftj's determined by the procedure satisfy the following lemmas and 
theorem. 
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Lemma 1 1. [&!,&&) and [b k ,z\ are disjoint. 

Proof. If not, then 6 X G [b k ,z\ But then a, which is in b±, would not be 
rejected by z when a reached z, a contradiction. □ 

Lemma 2 1. \h_i,b k ) does not sit in \k k ,z\- 

Proof. Each chair in [b k ,z\ is visited by an un-exhausted b k (because it con- 
tains z) before being visited by any block initially in [&!,& fc ), by Lemma 
1. □ 

Theorem 3 1. 

(i) bi is non-empty for 1 < i < k. 
(ii) di G [bi,b k ) for 1 < i < k. 

(Hi) does not sit in (di,b k ] for 1 < i < k during the seating process, 

(iv) b i+1 G (6j,6 fe ] forl<i<k. 
Proof. We use induction on i. 

1. When % — 1, b\ contains a. When % > 1, b^ G by (iv). Hence a 
is rejected at bi as it progresses towards z. The player seated at ^ must 
have originated in [ft^&j] for a to be rejected. By (iii) he cannot have 
originated in [6 X , c?i_i] , which leaves only b^ Hence bi is non-empty. 

2. By (i), di exists. By the procedure , di G [b_i,z\. By Lemma 2 and (iv), 
di ^ [fefc,^]- Hence di G 

3. By the procedure, bi does not sit in (di,b k ]. Also (b_i,di\ sits only in 
(6j, dj), otherwise 6i would not sit in di. So [bi, di\ does not sit in (di, b k ]. 
By (iii), [&i, c?i-i] does not sit in (di-i,b k ], and a portion does not sit 
in (di,b k ] by (ii). (For convenience take [&i>^o] to be empty.) Hence 

does not sit in (di,b k ]. 

4. This follows directly from (ii) and the procedure, which makes b i+1 = 
di + 1. 

□ 

Corollary 4 1. The procedure halts. 
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Proof. By (iv) of Theorem 3, as i increases bi gets closer to b k . □ 

We have now established that the distinguished blocks bi,b 2 ,---,b k are 
well-defined by the procedure. Hence the sample T can now be constructed, 
by grouping together the distinguished blocks of S as described earlier. 

We construct the (k + l)-pattern P to be matched by T thus. Player a is 
assigned to chair c (b\ 's initial chair in both S and T). For each distinguished 
block bi we assign player pi (as defined in the procedure) to chair c + i — 1. 
This completes the construction of P. It is trivial to verify that T matches 
P. 

We have thus far exhibited a map from rejections to matches. To see 
that the map is a bijection, we show that its inverse is totally and uniquely 
defined. 

Given a sample T that matches a (k + l)-pattern P, we show how to 
reconstruct the corresponding sample S. The pattern serves to identify the 
blocks 61, b 2 , b k - To form S, we merge these blocks with the remaining 
m — k undistinguished blocks of T. We leave block b\ where it is, in chair 
c. Let player a (the one who suffers the rejection we are constructing) be 
the second-ranked of the two players in P at chair c, the other of which we 
call p\. We assign undistinguished (possibly empty) blocks, in the order they 
appear directly following bk in T, to chairs c + 1, c + 2, ... until bi and b 2 are 
sufficiently far apart that in the seating process for S player pi will be seated 
before b\ arrives at 62 's initial chair. (Recall that for seating purposes we 
ranked the players in advance.) This fixes the position for b 2 . We now insert 
further undistinguished blocks from T between b 2 and 63, to seat player p 2 
(the player in chair c+ 1 in pattern P), and so on until the position for b k is 
determined. Player p k then becomes player z. It should be clear that when 
m > n this procedure will never require more than the available number of 
undistinguished blocks. This completes the demonstration of a one-to-one 
correspondence between rejections and matches. 

In conclusion, we have characterized the rejection aspect of musical chairs 
in terms of a reasonably natural correspondence with "matching chairs" , in 
which we are interested in matches between samples and patterns, rather than 
in rejections of players by occupied chairs. The correspondence is arrived at 
by the way of an "un-merging" of blocks which is specified by using the 
notion of "last player seated before rejection." 
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